Genome-Wide Identification, Localization, and Expression Analysis of Proanthocyanidin-Associated Genes in Brassica
نویسندگان
چکیده
Proanthocyanidins (PA) is a type of prominent flavonoid compound deposited in seed coats which controls the pigmentation in all Brassica species. Annotation of Brassica juncea genome survey sequences showed 72 PA genes; however, a functional description of these genes, especially how their interactions regulate seed pigmentation, remains elusive. In the present study, we designed 19 primer pairs to screen a bacterial artificial chromosome (BAC) library of B. juncea. A total of 284 BAC clones were identified and sequenced. Alignment of the sequences confirmed that 55 genes were cloned, with every Arabidopsis PA gene having 2-7 homologs in B. juncea. BLAST analysis using the recently released B. rapa or B. napus genome database identified 31 and 58 homologous genes, respectively. Mapping and phylogenetic analysis indicated that 30 B. juncea PA genes are located in the A-genome chromosomes except A04, whereas the remaining 25 genes are mapped to the B-genome chromosomes except B05 and B07. RNA-seq data and Fragments Per Kilobase of a transcript per Million mapped reads (FPKM) analysis showed that most of the PA genes were expressed in the seed coat of B. juncea and B. napus, and that BjuTT3, BjuTT18, BjuANR, BjuTT4-2, BjuTT4-3, BjuTT19-1, and BjuTT19-3 are transcriptionally regulated, and not expressed or downregulated in yellow-seeded testa. Importantly, our study facilitates in better understanding of the molecular mechanism underlying Brassica PA profiles and accumulation, as well as in further characterization of PA genes.
منابع مشابه
Genome-wide Association Study to Identify Genes and Biological Pathways Associated with Type Traits in Cattle using Pathway Analysis
Extended Abstract Introduction and Objective: Type traits describing the skeletal characteristics of an animal are moderately to strongly genetically correlate with other economically important traits in cattle including fertility, longevity and carcass traits. The present study aimed to conduct a genome wide association studies (GWAS) based on gene-set enrichment analysis for identifying the ...
متن کاملIdentification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملThe Genetics of Non-Syndromic Primary Ovarian Insufficiency: A Systematic Review
Purpose: Several causes for primary ovarian insufficiency have been described, including iatrogenic and environmental factor, viral infections, chronic disease as well as genetic alterations. Given the large number of genes described in the literature so far, the aim of this review was to collect all the genetic mutations associated with non-syndromic primary ovarian insufficiency. Methods: All...
متن کاملPapaya Dieback in Malaysia: A StepTowards A New Insight of Disease Resistance
A recently published article describing the draft genome of Erwiniamallotivora BT-Mardi (1), the causal pathogen of papaya dieback infection in Peninsular Malaysia, hassignificant potential to overcome and reduce the effect of this vulnerable crop (2). The authors found that the draft genome sequenceis approximately 4824 kbp and the G+C content of the genomewas 52-54%, which is very similarto t...
متن کاملIdentification and Expression Analysis of Two Arabidopsis LRR-Protein Encoding Genes Responsive to Some Abiotic Stresses
AbstractTwo Arabidopsis thaliana genes, psr9.2 and psr9.4 appearedto be highly similar to a phosphate-starved induced gene,psr9, isolated from Brassica nigra suspension cells.Sequence analysis classified the encoded polypeptides asmembers of leucine-rich repeat (LRR) proteins superfamily.The sequence of psr9 proteins comprise a unique N-terminalregion e...
متن کامل